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NOVEL PROMOTER SEQUENCES OF MYOSTATIN GENE 

The present invention concerns the novel promoter sequences of the myostatin gene, 
DNA constructs comprising the novel promoter sequences operably linked to the coding 
5 sequence of a gene of interest, to vectors containing the promoter or construct and host 
cells containing such vectors. 

Background of the Invention 

10 A promoter region of a gene is one which controls the expression of the gene. The 
myostatin gene was isolated and sequenced and the sequence published (McPherron AC; 
Lawler AM and Lee SJ; Nature 387, 1 May 1997; W094/21681). However the sequence 
of the promoter region of this gene has not been isolated, sequenced or published to date. 

15 Myostatin, also known as growth and differentiating factor- 8 (GDF-8), is structurally 
related to the transforming growth factor p (TGF-P) superfamily (McPherron et al 1997). 
The myostatin gene has been shown to inhibit muscle growth and initial research 
concluded that targeted disruption of the myostatin gene in mice leads to a twofold 
increase in muscle mass (McPherron et al 1997) indicating that the myostatin gene is a 

20 negative regulator of muscle mass. 

Three of the present inventors (Kambadur, Sharma and Bass) have identified a mutation 
of the myostatin gene in "double muscled" Belgian Blue cattle comprising an llbp 
deletion (Kambadur et al 1997). This mutation negates the effect of myostatin and leads 
25 to a 30% or 40% increase in muscle mass. 

The inventors have further observed two Belgian Blue sire lines which possess the same 
Belgian blue allele, i.e. the 1 lbp deletion, but which is being expressed in very small 
amounts. On a northern blot, two bands were seen suggesting that in these animals there 
30 is a double mutation of the myostatin gene, ie hypomorphic alleles. This strongly suggests 
that a failure in the upstream regions of the myostatin gene have resulted in a lack of 
expression of myostatin and as a consequence increased muscle growth. 

Summary of the Invention 

35 

The present invention provides an isolated DNA molecule having a polynucleotide 
sequence of SEQ ID NO. 1 of Figure 1 and which encodes the promoter region of the 
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» 

bovine myostatin gene, or a fragment or variant of said polynucleotide sequence which 
has been modified by the insertion, substitution or deletion of one or more nucleotides 
having substantially equivalent function thereto. 

5 In a further aspect, the present invention provides an isolated DNA construct comprising 
the DNA molecule of the invention operably linked to a heterologous gene of interest 
such that the heterologous gene is under transcriptional control of the DNA molecule of 
the invention. 

1 0 The DNA molecule may have been isolated from a natural source or may comprise cDNA. 

In a further aspect the invention provides recombinant expression vectors which contain 
a DNA molecule of the invention as defined above, and/or the DNA construct of the 
invention, and hosts transformed with such vectors and capable of induction of the DNA 
15 molecule and/or of inducible expression of the heterologous gene of the DNA construct. 
In a further aspect the invention provides a method of producing the sequence encoded 
by the DNA molecule comprising culturing a host cell, transfecting said host cell with a 
vector containing the DNA molecule of the invention or a fragment or variant thereof and 
cloning the DNA sequence by known methods. 

20 

In still a further aspect, the invention may be said to consist in a method of diagnosing 
muscle cell disorders in an animal, including a human, using the myostatin gene promoter 
sequence as defined above, as a diagnostic probe. The diagnostic method may comprise 
the steps: 

25 

i) obtaining a tissue or blood sample from an animal or human; 

ii) isolating the DNA by known methods; 

30 iii) optionally isolating myostatin DNA; 

iv) probing said DNA with a probe complementary to the myostatin gene promoter 
sequence; 



35 



v) 



optionally amplifying the amount of myostatin promoter DNA using PCR 
technology; 
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vi) analysing the myostatin promoter sequence DNA obtained with the probe for any 
mutations which may result in muscle cell disorders; and 

» 

vii) diagnosing muscle cell disorders caused by myostatin gene promoter sequence 
anomalies. 

The probe may comprise a complementary sequence to a part or the whole of the 
nucleotide sequences of myostatin promoter DNA of SEQ ID NO. 1 . 

The probe is preferable genomic or cDNA. 

In still a further aspect, the invention consists in a method of selection of animal breeds 
that express low levels of myostatin. This method comprises the steps: 

i) obtaining a tissue or blood sample from an animal or human; 

ii) isolating the RNA by known methods; 

iii) . determining the transcript level (mRNA) of myostatin gene, using Northern blot 

or RT-PCR technology or RNase Protection Assay. 

Preferably the animal is a cow, bull, sheep, pig, human or any other mammal, poultry, fish 
or any other economically important livestock breed. 

In still a further aspect, the invention consists in a method of expressing a heterologous 
gene of interest specifically in muscle cells comprising the steps: 

i) isolating the coding sequence of said heterologous gene; 

ii) ligating said coding sequence of said heterologous gene to the myostatin promoter 
sequence so that said coding sequence is under the transcriptional control of said 
myostatin promoter; 

iii) inserting the ligated construct into a suitable expression vector; 

iv) introducing the expression vector into a muscle host cell; and 
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v) optionally measuring the expression of the heterologous gene by recovering the 
product. 

• ■ 

The method may be carried out in vitro in muscle cells in culture or in vivo in an animal 
including a human. 

In a still further aspect, the present invention provides a method of producing a 
heterologous polypeptide or peptide comprising the steps of: 

(a) culturing a host cell which has been transformed or transfected with a vector 
containing the DNA construct as defined above to express the heterologous 
polypeptide or peptide encoded by the heterologous gene; and optionally 

(b) recovering the expressed polypeptide or peptide. 

In a still further aspect, the present invention provides a non-human transgenic mammal 
that expresses a heterologous polypeptide or peptide in their muscle cells, said non-human 
mammals having been transfected with the DNA constructs of the present invention. 

The heterologous gene of interest may be selected from the group consisting of: 



a) 


Myogenic regulatory factors; 


b) 


Myostatin and Myostatin receptor; 


c) 


Oncogenes; 


d) 


Genes that regulate muscle growth or differentiation; 


e) 


Muscular dystrophy gene; and 


f) 


Any genes expressed in muscle. 



The expression vector may be selected from the group consisting of Eukaryotic vectors, 
retroviral vectors or any vectors that are used for gene therapy. 
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The muscle host cell may be selected from in vitro cell culture lines selected from the 
group consisting of any primary culture of myoblasts or transformed myoblasts, satellite 
cells or any cell culture where myostatin promoter is active. Alternatively the host muscle 
cell may comprise a skeletal muscle cell in vivo in a host animal. 

The host animal may be selected from the group comprising cow, bull, sheep, pig, horse, 
rat, mouse, poultry, fish or human. 

In a still further embodiment the present invention provides a method of expressing 
myostatin or antisense to myostatin or ribozymes or any foreign gene in a host muscle 
cell/ myoblast comprising the steps: preparing a construct comprising the myostatin 
promoter ligated to a coding sequence of interest, so that said coding sequence is under 
transcriptional control of the myostatin promoter, cloning said construct into gene therapy 
vectors by standard cloning procedure and transfecting them into a desired cell line or 
tissue of a live animal or human. 

Preferably the host muscle cell is selected from the group consisting of a skeletal muscle 
cell, a somite, a myoblast or any mesodermally derived cells. 

* 

In a still further aspect the present invention provides a method of expressing dominant 
negative forms of myostatin or of any gene of interest in a host muscle cell comprising 
the steps: cloning the myostatin promoter ligated to desired mutated or wild type gene 
wherein said gene is under the transcriptional control of said myostatin promoter, into 
gene therapy vectors by standard cloning procedures and transfecting them into a desired 
cell line or tissue of a live animal or human. 

Preferably the host muscle cell is selected from a skeletal muscle cell, a somite, a 
myoblast or myotube. 

The method may be carried out in vitro in the host muscle cell in culture or in vivo in a 
host animal. The host animal may be selected from the group consisting of cow, bull, 
sheep, pig, horse, rat, mouse, human, poultry or fish. 

This invention may also be said broadly to consist in the parts, elements and features 
referred to or indicated in the specification of the application, individually or collectively, 
and any or all combinations of any two or more of said parts, elements or features, and 
where specific integers are mentioned herein which have known equivalents in the art to 
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which this invention relates, such known equivalents are deemed to be incorporated herein 
as if individually set forth. 

The invention consists in the foregoing and also envisages constructions of which the 
following gives examples. 

Description of Figures 

One preferred form of the present invention will now be described with reference to the 
accompanying drawing in which: 

Figure 1 shows the promoter sequence of the myostatin gene isolated from bovine 

muscle (SEQ ID NO. 1) - indicating consensus sequences for basic 
functional elements of known transcription factor binding sites. 

Figure 2 shows the promoter activity of the promoter sequence of Figure 1. 

Definitions 

The term "isolated" means substantially separated or purified away from contaminating 
sequences in the cell or organism in which the nucleic acid naturally occurs and includes 
nucleic acids purified by standard purification techniques as well as nucleic acids prepared 
by recombinant technology and those chemically synthesised. 

The term "variant" as used herein refers to a DNA molecule wherein the nucleotide 
sequence is substantially identical to the nucleotide sequence set out in Figure 1 . The 
variant may be arrived at by modification of the nucleotide sequence of the DNA 
molecule by such modifications as insertion, substitution or deletion of one or more 
nucleic acids, such modifications comprising neutral mutations which do not affect the 
functioning of the DNA molecule. 

The term "substantial sequence identity" means that two nucleotide sequences, when 
optimally aligned, such as by the programs Clustal W using default gap weights, share at 
least 60 per cent sequence identity, preferably at least 80 per cent sequence identity, more 
preferably at least 90 per cent sequence identity and most preferably at least 95 percent 
sequence identity or more. 
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The term "DNA construct" means a construct incorporating the nucleic acid molecule of 
the present invention, or a fractional fragment, neutral mutation or homolog thereof in a 
postion whereby a heterologous coding sequence is under the control of and operably 
linked to the promoter sequence of the invention and is capable of expression in a host 
cell. 

The term "neutral mutation" means a mutation, ie a change in the nucleotide or 
polypeptide sequence such as by deletion, substitution, inversion or insertion, which have 
no effect on the function of the encoded promoter sequence. 

A fragment of a nucleic acid molecule according to the present invention is a portion of 
the nucleic acid that is less than full length and comprises at least a minimum length 
capable of hybridising specifically with a nucleic acid molecule according to the present 
invention (or a sequence complementary thereto) under stringent conditions as defined 
below. A fragment according to the present invention has at least one of the biological 
activities of the nucleic acid or polypeptide of the present invention. 

Nucleic acid probes and primers can be prepared based on nucleic acids according to the 
present invention eg the sequence of SEQ ID NO: 1 or 2. A "probe" comprises an isolated 
nucleic acid attached to a detectable label or reporter molecule well known in the art. 
Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and 
enzymes. 

"Primers" are short nucleic acids, preferably DNA oligonucleotides 1 5 nucleotides or 
more in length, which are annealed to a complementary target DNA strand by nucleic acid 
hybridization to form a hybrid between the primer and the target DNA strand, then 
extended along the target DNA strand by a polymerase, preferably a DNA polymerase. 
Primer pairs can be used for amplification of a nucleic acid sequence, eg by the 
polymerase chain reaction (PCR) or other nucleic acid amplification methods well known 
in the art. PCR-primer pairs can be derived from the sequence of a nucleic acid according 
to the present invention, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5° 1991, Whitehead Institute for Biomedical Research, 
Cambridge, MA). 

Methods for preparing and using probes and primers are described, for example, in 
Sambrook et al. Molecular Cloning: A Laboratory Manual, 2nd ed, vol. 1-3, ed Sambrook 
et al. Cold Spring Harbour Laboratory Press, Cold Spring Harbour, NY, 1989. 



WO 00/01810 



PCT/NZ99/00107 



-8- 

Probes or primers can be free in solution or covalently or noncovalently attached to a solid 
support by standard means. 

The term "operably linked" means a first nucleic acid sequence linked to a second nucleic 
acid sequence when the first nucleic acid sequence is placed in a functional relationship 
with the second nucleic acid sequence. For instance, a promoter sequence of the present 
invention is operably linked to a coding sequence of a heterologous gene if the promoter 
affects the transcription or expression of the coding sequence, 

A "recombinant" nucleic acid is one that has a sequence that is not naturally occurring or 
has a sequence that is made by an artificial combination of two otherwise separated 
segments of sequence. This artificial combination is often accomplished by chemical 
synthesis or, more commonly, by the artificial manipulation of isolated segments of 
nucleic acids, eg, by genetic engineering techniques. 

Techniques for nucleic acid manipulation are described generally- in, for example, 
Sambrook et al. (1989). 

The ternv "stringent conditions" is functionally defined with regard to the hybridization 
of a nucleic acid probe to a target nucleic acid (ie to a particular nucleic acid sequence of 
interest) by the hybridization procedure discussed in Sambrook et al. (1989) at 9.52-9.55 
and 9.56-9.58. 

Regarding the amplification of a target nucleic acid sequence (eg by PCR) using a 
particular amplification primer pair, stringent conditions are conditions that permit the 
primer pair to hybridize only to the target nucleic acid sequence to which a primer having 
the corresponding wild type sequence (or its complement) would bind. 

Nucleic acid hybridization is affected by such conditions as salt concentration, 
temperature, or organic solvents, in addition to the base composition, length of the 
complementary strands, and the number of nucleotide base mismatches between the 
hybridizing nucleic acids, as will be readily appreciated by those skilled in the art. 

Detailed Description of the Invention 

As defined above, in its primary aspect, the present invention is directed to a novel 
promoter sequence of the myostatin gene and the uses thereof as a diagnostic test for 
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muscle cell disorders and genetic marker for the selection, in particular, of cattle/sheep 
breeds that express low levels of myostatin. The novel promoter sequence is also useful 
for expressing foreign genes or dominant negative forms of myostatin in muscle. 

5 The novel promoter sequence of the present invention comprises an isolated DNA 
molecule having a polynucleotide sequence of SEQ ID NO: 1 of Figure 1 or a fragment 
or variant thereof having substantially equivalent transcriptional activity there. 

The technology used to isolate the myostatin promoter from a genomic DNA library is 
10 described generally in Sambrook et al, "Molecular cloning", Second Edition, Cold Spring 
Harbour Press (1987). 

An initial step involves the use of a specific myostatin DNA probe followed by cloning 
in a host cell and subsequent amplification using PCR technology. 

15 

Preferably, the host cell in which the DNA sequence encoding the myostatin promoter is 
cloned is a prokaryote such as E. coli. Other prokaryotes can also be used, for example 
bacilli such as Bacillus subtilis and enterobacteriaceae such as Salmonella typhimurium 
or Serratia marcesans. 

20 

In general, where the host cell is a prokaryote, cloning vectors containing replication and 
control sequences which are derived from species compatible with the host cell are used. 
The vector may also carry marking sequences which are capable of providing phenotypic 
selection in transformed cells. For example, E. coli has commonly been transformed 

4 

25 using pBR322, a plasmid derived from an E. coli species which contains genes for 
ampicillin and tetracycline resistance and thus provides easy means for identifying 
transformed cells. 

In addition to prokaryotes, eukaryotic microbes, such as yeast may also be used, for 
30 example, Saccharomyces cerevisiae or cultured cells derived from multicellular organisms 
such as mammals and insects may be used as hosts. Examples of mammalian cultured 
cells which may be useful are HeLa cells and Chinese Hamster Ovary (CHO) cells, 
myoblasts, muscle cells, fibroblasts, and satellite cells. 

35 Suitable cloning vectors for use in mammalian cells include SV40 or other suitable viral 
vectors. 
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The DNA molecule of the invention can be contained within a DNA molecule isolated 
from an appropriate natural source (muscle, blood) or can be produced a$ intron-free 
cDNA using conventional techniques. cDNA is preferred. 

The invention also contemplates variants of the DNA molecule which differ from the 
native sequence by the insertion, substitution or deletion of one or more nucleotides. 
These variants can be made through selective synthesis of the DNA using an appropriate 
synthesizer or by modification of native DNA by, for example, site specific or cassette 
mutagenesis or by any other techniques known in the art. Such variants may be between 
60-95% homologous in the DNA sequence of SEQ ID NO: 1 of Figure 1 and have 
substantially equivalent function thereto. 

The production of DNA fragments, probes and primers is also well within the capabilities 
of the skilled worker. 

The DNA molecule may comprise a native bovine myostatin isolated from any 
appropriate natural source or can be produced in the form of a synthetic oligonucleotide 
where the size of the active fragment to be produced permits. By way of example, the 
Triester method of Matteucci et al, J. Am. Chem. Soc. Vol 103: 3185-3191 (1981) may 
be employed. 

Once obtained, the DNA molecule is treated to be suitable for insertion together with the 
selected control sequence into the appropriate cloning vector. To this end the DNA is 
cleaved, tailored and religated as required. 

Cleavage is performed by treating with restriction enzyme(s) in a suitable buffer. Any of 
the large number of commercially available restriction enzymes can be used as specified 
by the manufacturer. After cleavage, the nucleic acid is recovered by, for example, 
precipitation with ethanol. 

Tailoring of the cleaved DNA is performed using conventional techniques. For example, 
if blunt ends are required, the DNA may be treated with DNA polymerase I (Klenow), 
phenol and chloroform extracted and precipitated by ethanol. 

Re-ligation can be performed by providing approximately equimolar amounts of the 
desired components, appropriately tailored for correct matching and treatment with 
appropriate ligase (eg T4 DNA ligase). 
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In a further aspect, the present invention consists in a DNA construct comprising the DNA 
molecule of the invention, ie the bovine myostatin promoter, operably linked to a 
heterologous gene of interest such that the heterologous gene is under the transcriptional 
control of the bovine myostatin promoter. 

The constructs may comprise a bovine myostatin promoter operably linked to a 
heterologous gene of interest selected from the group comprising myogenic regulatory 
factors; myostatin and myostatin receptor; oncogenes; genes that regulate muscle growth 
and differentiation; the muscular dystrophy gene; and any other genes expressed in 
muscle. 

Similar techniques to those described above would be employed when making constructs 
comprising the myostatin promoter ligated to a foreign ie, heterologous gene which may 
be desired to be expressed in muscle cells either in vitro or in vivo. 

In vitro expression may occur in cultured muscle cells such as fibroblasts, HeLa cells or 
in cultured myoblasts or myotube cells using replicable transfer vectors suitable for use 
in the expression of the DNA constructs of the invention to produce a heterologous 
protein. 

The replicable transfer vector may be selected according to the host or host cell to be used. 
Useful vectors will generally have the following characteristics: 

(a) the ability to self-replicate; 

(b) the possession of a single target or any particular restriction endonuclease 
cleavage site; and 

(c) desirably, carry genes for a readily selectable marker such as antibiotic resistance. 

Two major types of vectors possessing these characteristics are plasmids and bacterial 
viruses (bacterophages or phages). 

Suitable eukaryotic expression vectors which may be employed include, for example 
pcDNA 1.1, and the expression product could be isolated and measured. 

In vivo expression of a construct of the invention could be brought about by known gene 
therapy techniques. For example the myostatin promoter could be inserted in the Moloney 
murine leukemia virus (MoMuLv), Harvey murine sarcoma virus (HaMuSv), murine 
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mammary tumor virus (MuMTV), and Rous Sarcoma Virus (RSV). A number of 
additional retroviral vectors can incorporate multiple genes. By inserting the myostatin 
promoter into viral vectors, the vector could now be used for expressing muscle specific 
genes. 

* 

In the construction of a vector it is an advantage to be able to distinguish the vector 
incorporating the foreign DNA from unmodified vectors by a convenient and rapid assay. 
Reporter systems useful in such assays include reporter genes, and other detectable labels 
which produce measurable colour changes, antibiotic resistance and the like. In one 
preferred vector, the P-galactosidase reporter gene is used, which gene is detectable by 
clones exhibiting a blue phenotype on X-gal plates. This facilitates selection. In one 
embodiment, the P-galactosidase gene may be replaced by a polyhedron- encoding gene; 
which gene is detectable by clones exhibiting a white phenotype when stained with X-gal. 
This blue-white colour selection can serve as a useful marker for detecting recombinant 
vectors. 

Once selected, the vectors may be isolated from the culture using routine procedures such 
as freeze-thaw extraction followed by purification. 

< 

In a still further aspect, the invention provides a method of selecting animals which have 
low myostatin expression which includes essentially the use of the myostatin promoter 
sequence as a genetic marker. This could be accomplished by PCR amplification of the 
myostatin promoter using genomic DNA isolated from blood with the primers designed 
along the promoter sequence and scanning the genomic DNA for mutations. 

Non-limiting examples illustrating the invention will now be provided. 

It will be appreciated that the above description is provided by way of example only and 
variations in both the materials and techniques used which are known to those persons 
skilled in the art are contemplated. 

Example 1 

1. Cloning of myostatin gene promoter 

5 'upstream regulatory sequences of bovine myostatin gene was isolated from a bovine 
lambda genomic library (Stratagene) by the method of Sambrook et al (1989). The library 
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was screened with a 500 bp genomic DNA (-1 700 to - 1200) isolated by inverse PCR. The 
entire genomic clone was subcloned into pBluescript and sequenced. 

This PI-PCR procedure is a modification of the method described by K M Pang and D A 
Knecht (1997). A series of Sau3Al digests were prepared for use as templates for PCR. 
Sau3Al (New England Biolabs, USA) digests of 10-jxg aliquots of bovine genomic DNA 
were carried out at 2, 0.5, 0. 125, 0.03 1 , 0.008, 0.004, 0.002 U/fig DNA for 1 hour at 37°C 
The reactions were stopped by heating at 65°C for 20 min. Two micrograms of each 
reaction mixture were separated on a 1% agarose gel to check the extent of cutting. On 
the basis of extent of digestion, one partial digest (0.008 U of Sau3Al/^g DNA) was 
chosen for further work. The DNA was then ethanol precipitated and resuspended in lx 
ligase buffer to the final concentration of 5ng/nl. T4 DNA ligase (New England Biolabs, 
USA) was added to a concentration of 4U/^1 to carry out ligation at 22°C overnight. 
Ligation was then stopped by heating at 65 °C for 20 min, phenol and then chloroform- 
extracted, ethanol precipitated and resuspended in sterile water to a concentration of 50 
ng/jil. Two primers (5'- CTGCTCGCTGTTCTCATTCAGATC and 5'- 
ATCCTCAGTAAACTTCGCCTGGA) facing outwards from 5 * translated DNA sequence 
of bovine myostatin were used to carry out PI-PCR. PCR was carried out in 50 ^1 reaction 
volume containing 250 ng re-ligated DNA, 0.2 mM each dNTP (Gibco BRL, USA). 0.8 
\iM of each primer and 2 \il of Elongase enzyme mix (Gibco BRL, USA) in 1 x "B" buffer 
supplied by the manufacturer was added. PCR was performed for 35 cycles at: 94°C for 
30 sec, 55°C for 2 min, 68°C for 6 min. PCR product was separated on a 1% agarose gel 
and a 2-kb band was excised, purified with the Wizard PCR Preps DNA Purification 
System (Promega, USA) and cloned by the TA Cloning System (Invitrogen, USA). 2-kb 
insert was then sequenced using M13 forward, reverse, 5 '-GGCTGTATGTGACATGCG 
and 5 '-TG AACC ACTGC ACTCTCTTG primers. 

2. Construction of CAT reporter vector: 

3.3 kb of myostatin upstream genomic DNA was PCR amplified using bovine genomic 
DNA as template. The primers used in the PCR are as follows 
GGGGTACCCCAATTCCTGGGACAAATTCTCTA (FORWARD) 
GGGGTACCGGTTTTAAAATCAATACAATCT (REVERSE). The PCR (35 Cycles) 
conditions were 94°C for 20 Sec, 50°C for 30 Sec, and 72°C for 60 Sec. The amplified 
3.3 kb DNA was cloned into p CAT3 Promoter vector (Promega) as a Kpn I fragment 
using standard cloning procedures. This construct will be referred to as pMS 26. 
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3. Transfections and CAT ELISA 

Four micrograms of CMV-CAT (positive control), p CAT promoter (negative control) and 
pMS 26 vectors were transfected into actively growing C2C12 myoblast cultures using 
5 Fugene 6 (Roche Diagnostics) according to the manufacturers protocol. Twenty four 
hours after the transfection the C2C12 cells were made to differentiate in low serum 
medium for a period of forty eight hours. The cells were subsequently washed and 
amount of CAT protein in the cell extract was measured using a CAT ELISA kit (Roche 
Diagnostics) according to the manufacturer's protocol. 

10 

RESULTS 

3.3 kb of upstream sequences harbours myostatin enhancer sequences 

15 In order to identify the enhancer sequences that drive myostatin expression in a myogenic 
cell, a reporter CAT construct was made with the 3.3 kb of upstream genomic DNA 
(pMS 26) and transfected into C2C12 myoblast cell line. The pCAT promoter vector and 
CMV CAT constructs were used as negative and positive controls respectively in this 
assay. When cell lysate from transfected C2C12 myoblasts were assayed for CAT protein 

20 by ELISA, it was observed that the cells transfected with myostatin promoter construct 
(pMS 26) had a CAT content of 250 picograms/ml (Fig 2). Whereas in the cells 
transfected with a negative control vector (p CAT promoter vector) CAT protein content 
was 5 picograms/ml (Fig 2). 

25 Discussion 

Transcriptional regulation plays a crucial role in determining patterns of tissue specific 
gene expression during development and differentiation. During the early embryonic 
myogenesis, several transcription factors like MRFs (MyoD, Myogenin, Myf-5 and MRF- 

30 5), Pax-3 control the determination, proliferation and the differentiation of myogenic 
precursor cells (myoblasts) by positively regulating the transcription of muscle specific 
genes (Rudnicki and Jaenisch, 1995). Myostatin, a gene which is shown to negatively 
regulate the growth of muscle, is expressed early in embryonic myogenesis in somites and 
the expression is continued in adult axial and paraxial muscle (Kambadur et al., 1997). 

35 Furthermore, it is also shown that myostatin gene expression is differentially regulated in 
different axial and paraxial muscle (Kambadur et al., 1997). Here, we have isolated the 
upstream sequence of myostatin gene and have characterised the several muscle specific 
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DNA binding sequences in the upstream genomic DNA. Furthermore, we have also 
shown that 3.3 kb of upstream DNA can drive the expression of CAT gene in a myogenic 
precursor cells lines. The isolated upstream sequence has consensus sequences for a 
typical eukaryotic basal promoter comprising 'CAT 1 Box and TATA 1 box sequence 
(Cohen et al 1986, Wingender 1988). A consensus for CAT box (CAAATG) is centered 
at -206 bp where as consensuses for two TATA boxes are centered at -139 and -163 bp 
respectively which binds the well characterised basal transcription factor TBP (TATA box 
Binding Protein), an early step in the formation of pre-initiation complex (Kambadur et 
al., 1990) of RNA polymerase II mediated transcription. In addition, we have also 
discovered several f F boxes in the upstream sequences of myostatin (Lassar et al 1989). 
E box has been shown to bind to the basic helix loop helix (bHLH) transcription factors, 
myogenic regulatory factors like MyoD, Myf-5, MRF-4 and myogenin. These bHLH 
transcription factors bind to the promoters of several muscle specific genes through bHLH 
DNA binding domain and control the transcriptional activation of these genes (Rudnicki 
and Jaenisch, 1995). We have also found a binding site for another muscle specific 
transcriptional activator MEF-2 (CTAAAAATAAT) at -584 bps in myostatin promoter 
indicating that myostatin gene regulation may occur by several independent muscle 
specific transcription factors (Gossett et al 1989). 

Conclusion 

• A genomic clone containing 10.0 kb of upstream regulatory sequence of myostatin is 
isolated for the first time. 

• The enhancer sequences of the myostatin gene have been characterised. 

• Several E boxes and one MEF-3 binding site that are specifically observed in muscle 
specific enhancer sequences are also seen in myostatin enhancer sequences. 

It will be appreciated that it is not intended to limit the invention to the aforementioned 
examples only, many variations, such as might readily occur to a person skilled in the art, 
being possible without departure from the scope thereof as defined in the accompanying 
claims. 
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CLAIMS: 

1 . An isolated DNA molecule comprising the polynucleotide sequence of SEQ ID 
NO: 1 and which encodes the promoter region of the myostatin gene, or a 
fragment thereof, or variant thereof which has been modified by the insertion, 
substitution or deletion of one or more nucleotides, said fragment and variant of 
said polynucleotide sequence having substantially equivalent function thereto. 

2. An isolated DNA molecule encoding a promoter region of the myostatin gene, 
said DNA molecule being selected from the group consisting of: 

a) a DNA molecule that is at least 60% identical to a DNA molecule of SEQ 
ID NO: l;and 

b) a DNA molecule that hybridises under standard conditions to a DNA 
molecule of a). 

3. An isolated DNA molecule as claimed in claim 2, selected from the group 
consisting of: 

a) a DNA molecule that is at least 70-95% identical to a DNA molecule of 
SEQ ID NO: l;and 

b) a DNA molecule that hybridises under stringent conditions to a DNA 
molecule of a). 

4. An isolated promoter sequence comprising of at least a portion of the 
polynucleotide sequence of SEQ ID NO: 1 sufficient to drive expression of a 
heterologous gene operably linked thereto, or a variant thereof having 
substantially equivalent function thereto. 

5. An isolated promoter sequence as claimed in claim 4, wherein said promoter is 
tissue specific. 

6. An isolated promoter sequence as claimed in claim 5, wherein said promoter is 
specific for driving expression in muscle cells. 
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7. An isolated promoter sequence as claimed in any one of claims 4-6 comprising 
the bovine myostatin promoter or a fragment or variant thereof 

8. A recombinant cloning vector comprising the DNA molecule as claimed in any 
one of claims 1-3. 

9. A host cell transformed or transfected with the recombinant cloning vector as 
claimed in claim 8. 

10. An isolated probe comprising at least 12 consecutive nucleotides of the 
polynucleotide sequence of SEQ ID NO: 1, or the complement thereof. 

11. A recombinant DNA construct comprising the promoter sequence of any one of 
claims 4-7 operably linked to a coding sequence of a gene of interest. 

12. A recombinant DNA construct as claimed in claim 1 1 wherein the gene of interest 
is selected from the group consisting of myogenic regulatory factors, myostatin 
and myostatin receptors, oncogenes, genes that regulate muscle growth and 
differentiation, muscular dystrophy, and any other gene expressed in muscle. 

13. A vector containing the DNA construct as claimed in claim 1 1 or 12. 

14. A host cell transformed or transfected with the vector as claimed in claim 13. 

15. A method of cloning the DNA molecule as claimed in any one of claims 1-3 
comprising the steps: 

a) inserting the isolated DNA molecule into a suitable replicable cloning 
vector; 

b) transforming or transfecting a host cell with said vector in vitro; 

c) culturing host cells; and 

d) isolating cloned DNA molecule. 
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16. An isolated DNA molecule as claimed in claim 1 substantially as herein described 
or exemplified with reference to the accompanying drawings. 

17. An isolated promoter sequence as claimed in claim 4 substantially as herein 
described or exemplified with reference to the accompanying drawings. 

18. A recombinant cloning vector as claimed in claim 8 substantially as herein 
described or exemplified with reference to the accompanying drawings. 

19. A host cell as claimed in claim 9 substantially as herein described or exemplified 
with reference to the accompanying drawings. 

20. An isolated probe as claimed in claim 1 0 substantially as herein described or 
exemplified with reference to the accompanying drawings. 

21. A DNA construct as claimed in claim 1 1 substantially as herein described or 
exemplified with reference to the accompanying drawings. 

22. A vector as claimed in claim 13 substantially as herein described or exemplified 
with reference to the accompanying drawings. 

23 . A host cell as claimed in claim 1 4 substantially as herein described or exemplified 
with reference to the accompanying drawings. 

24. A method as claimed in claim 1 5 substantially as herein described or exemplified 
with reference to the accompanying drawings. 
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SEQ ID NO: 1 




G AATTCTTCTATGG C T ATC AC C ATG TC C A 


-10374 


AAACACTACCTGAGTAAAGAAGGAGTCATTTTTTTTTTAAGGGTTTAAAA 


-10324 


AATTTAAAAATTTTTAAATTAAATTTAAAATTGAAATCTAAAAAAATTTT 


-10274 


AAGAAAAATTTTTTTTAATCTGCAGAAACATACTCTGTATTGAGTCTATG 


-10224 


GTT A C C AAAGATC CC C C C AG AAC AAAAC C C C AAG AATTG C AAG ATTTTC T 


-10174 


C TTGGGAAG TG TTTGTTT AC TGG AAGG CTT ATT AACTC AATAATAAGGGA 


-10124 


GAGTAAAGACTTCAAAACTAAAGATCTGTTTTTGTCAGTTCAATCTATAT 


-10074 


TGTCATCTGCTCTGGAAACCCTGAGCTTGTTCTAAAGTAAACTGAACTAT 


-10024 


CATGAAGAAAATCAGCCATCAAAATAGTGAAAACGAACCTCTTACACTCA 


-9974 


GTTTTTTCTCATTTGTAAATTTAAAAAATTTAACTTCATTAAGTCTGGGG 


-9924 


ACCAATGTATAGCTGCAAGCAGGAATTTTATGCCTTCTTAATCCAGAGAG 


-9874 


GTGGGCAATGGTGTGGGTTTAGACCAGAAGTAAGGATGTGGGTGCAGCAA 


-9824 


ATATGACAGCCCAACTGCTGCTGCTGCTGCTAAGTCACTTCAGTCGTGTC 

* * 


-9774 


CGACTCTGTGTGACCCCAGAGACGGCAGCCCACCAGGCTCCCCCGTCCCT 


-9724 


GGGATTCTCCAGGCAAGAACACTGGAGTGGGTTGCCATTTCCTTCTCCAA 


-9674 


TGCATGAAAGTGAAAAGTGAAAGTGAAGTCGCTCAGTCGTGTCCGACCCT 


-9624 


CAGTGACCCCATGGACTGCAGCCTACCAGGCTTCTCCGTCCATGGGATTT 


-9574 


TCCAGGCAAGAGTACTGGAGTGGGGTGCCATTGCCTTCTCCATGACAGCC 


-9524 


CAACTAGAGACTGCTATAACTTATATCAGAGAATCTGAAACTCCCTACAG 


-9474 


GCTATCTTGCTATATGTGCATTCAACATGAACAAAAAGAAAATTCAAGTC 


-9424 


CAGTAAAAAGGGCTAAAATAGTGAATCACAGACTTTTGCCAAACAACAGT 


-9374 


CAATGATTTGGGCAAAGGACTTAAGTAGCTATGCTTATTTGCTATTAGTA 


-9324 


GAATAAAGAAATGTTTGAGCCTGGTTCTCAATGCTTGAATAATAAGATTC 


-9274 


CTTTTC AAGG AAAAACAATTTTTTTG AG AACTC AT AGATAAAAGTCAAAT 


-9224 


wmi^nui i in^iijiLAti i ^AAUCLAACTTGATTATAGTAAACAAATT 


-9174 


CAGCTGTAAACATTTTTTTATAACACTGTAGACACTGTATTTTTATACAC 


-9124 


TGTATAAATGAAGACACTGAAGATGTAAAGTGGTGACCTGGATGATCCAG 


-9074 
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AATATGCAaTATGCACCAAAAACTAAGTTTTTTAAATGATACAAATGAAC 


-9024 


TTACTTACAAAAGGGAAATAGAGTCACAGATGTAGAAAACATACAGTTAC 

* 


-8974 


y*^T • J - T ^1 J**% Tl ^M<% A jM^ jM^ Sfc, ^M^ VW, ^fc S» -^s_ - . 

CTAGGGGGACGGGGGGAGGGTAAACTGGGAGATAGGAATTGACACATACA 


-8924 


C TCTAC TATATATACAC TAGGTAAC TAATATGAAC C TACTGTATAGCGCA 


-8874 


CAGAACTCTACCCAATACTCTGTAATGACATTTATGGGAATAGAATCCAA 


-8824 


TV TV ^^T. TV ^^T TV /" T4 ^TTj Tk * ^T t mm « m vt ^ k mmm> _t te — — 

AAAGAGCAGATATGTGTATATGTATAAACTGATTCACACCCCTTGCCTTG 


-8774 


CCTAAACACACACAGACACACACAGACACATCCATTTTGAGATTTCTTTC 


-8724 


CTCCTCTTTGGCTACTTTTGACTCAGTTTGACCTAGATAGGACCTGTGCT 


-8674 


GGGTAAGGGCTTTGGCAGTCTTAAGTAGTGTCAGAGCAAGCACAGTCACT 


-8624 


v-C<wTAGCaAGGTCTTATTACTAGCTACAGAGTCTCTCTGTGCAGTCATAG 


-3574 


^TCACTGAGTGCAAAAGTCCAGTGCAATGTTTGAGTCCCATGCATGCTCT 


-8524 


v-ACAGAACTGAGAGAGGAGGCAAGGAGACAGGACCCTCCAGTGCTGGGAC 


-8474 


TCTCTACTGGGGTGAGCAAGAGGGAACCCAATAGAAATGCTGCGAACCAC 


-8424 


AGAGCCACTCAGAAAGCCTGTAAATATAAACATAACAACACATTATGAAT 


-8374 


4 

ATATAAGTaTAAGTATACCTACCAAAAAATCAGAAAGTTGAACTTTTTTG 


-8324 


TATTCAaGGGAACAGTCATTCATTTATCGGTTCAGTTATCAGTAAACACA 


-8274 


tttgggaactgtaaaaatctaatcagttcagttcagttcagtcgctcagt 


-8224 


v-GTGTCTCTTTGCGACTCCATGAATCGCAGCACGCCAGGCCTCCCTGTCT 


-8174 


ATCACCAACTCCCGGCGTTCACTCAGACTTACGTCCATCGAGTCAGTGAT 


-8124 


Gv-CATCCAGCCATCTCATCCTCTGTCGTCCCCTTCTCCTCCTGCCCCCAA 


-8074 


TCCCTCCCAGCATCAGAGTCTTTTCCAATGAGTCAACTCTTCTCACGAGG 


-8024 


TGGCCAAAGTACCAAGTCTTTGAGTCTAGTCTCTTTTTTCAATGGAGAAG 


-7974 


AAGAGGAAACCAAATTATAACTTAATTTTTATTCTTTGTATTACAAGTGT 


-7924 


ATAATTAATACACTGGAGTTTCCATTTCAGAAAGGAAGAAGAGAAATCAC 


-7874 


ATTTTGCAGCTTTCCTGAACTAATACAAAGAAATGCAGAAGTTTTGTTAA 


-7824 


CTGGGATATTATTATGACACGTTGCCATAATATGAATGTCATCATCTCAA 


-7774 


GACTGACCTGAAAACCAAAATAAAAAAAAAAAGAGAGAGAGAGAGAGAGA 


-7724 
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GAGAGAAAAAAAAGAAAACCCTCCAGACCAGATTTCAGTCTACCACTTGA - 7 6 7 4 

GCTGAC AAACATTGGCCAAATGTCCTCTACAGAACCTGTAAGTTAGTAGT - 7 6 2 4 

TGGTAATTATAAAACAATAAAGTATATATTTCTGCTCCTGGCAATAATTA - 7 5 7 4 

TGTATCAGTTAAAAAGTATACATTGTCTAAAAAATCACCATATTTGATGT - 7 5 2 4 

CTCATTAAAACAAAGTTACAAATATTAAGATGAGAGGACTTAAAGTTAGA - 7 4 7 4 

TGGGAAAATATTCAATTGAAGCAGTATAAAATGCATTACTCTGGGGCAAA - 7 4 2 4 

GTGTGGTCTGGAGATCCCTGGAGTGAAGACCCTTTTAGACAATCTGTGGA - 7 3 7 4 

GTAAGAACTGTTTTCATAACAAAGCTAAGATTTCCTTGCTATTCTCATTT - 7 3 2 4 

TCTCTTACGTATATAGTCGAGTTTTCCAGAAGTTCCATGTTATGTAACAT - 7 2 7 4 

CATCATTACTCTGTCAGCAAATAGAATACATGTTTGCATATGTCTATATT - 7 2 2 4 

CTAACATTTCTCATTTTTAATTTATAATTCATTAAATATTGATAGATATG - 7 1 7 4 

ACCCACATAAACAAAAGCTTTTC AGGATCCTGAATAACTTTTCAGAGTAA - 7 1 2 4 

AGGAATCTTGAGACCAAAAGGTTTGAGAATTACTGTTTTAGGGGATACAC - 7 0 7 4 

CTCAAGTGAAAGAGCACGCCTC ATCACATTTCCACTCTATATGGAAGAAT - 7 0 2 4 

CTAGAAGATTGAACCTATAATTG AAGAGTGCAGGCTTTATGAAGACAGTG - 6 9 7 4 

ACTGTTTCTGTCTGCTTTCTCCTCAGAATTTAGCATGATGCCAGGCACAT -6924 

AATTTGTTGTTGTTCAGCTGCTAAGTCATCTCTGACTTTTTGCAACCCCA - 6 8 7 4 

TGAACTGCATGCAACATGCCAGGTTTCTCTGCTTTCACTATCTCCTGGAG - 6 8 2 4 

TTTG CTC C AAC TC ATGTC C ATTG AG TC AC TGAT AC C ATCC AACG ATCTCA - 6 7 7 4 

TCCTCTATAAGACACATTATAGACATTAGAATATTTTCATAAAATAATAA - 6 7 2 4 

GTGAATTAATACAGCTGAAACTCAAACAGCATAGGGGTTACAAGTACCAA - 6 6 7 4 

CTCGCGTCCAAGTTGAAAATCCACATATAATCTTAAGGTCAGCCTTGGAT - 6 6 2 4 

AC ATGC ATATC CAAGGTTCC CCATC TGAGG ATTC AACC AAC CTCAG ATAG - 6 5 7 4 

TG TAGT A C TG C AG TAC AC ATTTAG TG AAAAATGTGC ATATAAG TGGAC C C - 6 5 2 4 

ATGCAGGTCAAACCTGCGTTTTCCAAGGGTCCACAGTACACACACACATG - 6 4 7 4 

CATACATGCATCTCTAAATGAAAGCTTTGCCATCTGACTTACTCAAGGTC - 6 4 2 4 

ACATAAAACGTCAGCGAGAAACCCAGAACTATATTCCAGATCTCTGTTCC - 6 3 7 4 

Fig 1 contd 



WO 00/01810 PCT/NZ99/00107 

4/10 



TATAC TG m T A P TP PPTP AP TP A & PPPT A P^rprppmmmmrnomm/-« tv mmmmmTv 


-6324 


iiLibl i b l A 1 1 bAGA l bAbAGATAATC AGATGTTGCCACTGTAGGAT 


-6274 


bbbAbbb 1L1 LA 1 bb lbTTATGTGAATTGAGCACTATCCAGTTTGTTTCT 


-6224 


P_P_P 'T ir P r P A A PTPT A 7\ rp/*> tv /-^ tv tv 7\ /-1 m/~* mm tv — — . m/-i ■* * — * — * ~* s-. — — . — . __ _ _ 

bbb 111 AAb ibl AA 1 b AbAAbAGTGTTATATCAAAGGGCTATCATCACAA 


-6174 


vjoruui 1 vj\j^/i/iVjAVj 1 vjA i b AbA 1 AAAATGCATCTTTCTCTCTTTTCCCAC 


-6124 


/vAL/ivjAL iLUAAi 111 is».AibAl 1 b ATCCTTATTCTAATTCTTCAGTTCA 


-6074 


bi ibAbi iLAbltLl ± bAb lb ATGTCCAACTCTTTGCCACCCCATGAATC 


-6024 


1 LAbLAAbtLAbbL b TCCCTGGTCCATCACC AACTCCCAGAGTTCACCC A 


-5974 


P A PTP A P P rPP P 7. TVT" 1 7\ /tp/t tv /-»m^- > mnn/i m>-«m M _ _____ 

bAb 1 LAbb ibbA I bbAbTbAGTGATGCCATCTAGCCATCTCATCCTCTGT 


-5924 


bbibL-bAiblLiiCCi GCCCCv-AATCCCTCCCAGCATCAGAGTCTTTTC 


-5874 


b aa 1 bAb x bAACTTTTCACATGAGGTGGCCAAAGTACTGGAGTTTCAGCT 


-5824 


r P r P A PP A T'P A p P r Pr^i r> T"T r^r^c * 7v 7\. 7\ tv tv tv rn/*-« — . « y— - _ _ 

1 l Abbr* 1 LATTCCTTCCAAAGAAATCCCAGGGCTGATCTCCTTCAGAATG 


-5774 


^au 1 u^i 1 1 bbA 1 CTCCTTGCAGTCCAAGGGACTCTCAAGAGTCTTCTCCA 


-5724 


A P* A P^P** A P* A P 1 'T^'T , /~ , 7\ 7\ 7 v 7\ /" • /" 1 tv rn/ - * t\ tv mm/ - tmr- * ^»>»n • • — , — ~ __. _ _ _ 

AbAbLALAb 1 1 CAAAAGCATCAATTCTTCGGCGCTCAGCTTTCTTCACAG 


-5674 


TPPA A P f PP r PPZ4PIi f rPP7i T>7a P« 7* T>r^" 7\ rr^^ ptv nr> -» -» ^, _. _, -« ^ ^ _„ . 

i bbnnb ibl bAbA l L l. A 1 AL ATGACCACAGGAAAAACCATAGCCTTGACT 


-5624 


AbAibbAbbi 1 ibl x bbbAAAb 1 AATGTCTCTGCTTTTCAATATGCTATC 


-5574 


T 1 A P^P^ r P m P >, P ,r PP > 7> m 7A 7A r - mrnpfimmr'n t\ ^ -mt - M — « • — - ,. 

. AVjb- x -LjUI lai AACTTTCCTTCCAAGGAGTAAGCATCTTTTAATTTCAT 


-5524 


LrbiL 1 ulau 1 LALLATTTGTAGTGATTTTGGAGCCCAGAAAAATAAAGTCT 


-5474 


1 bAbAb 1L1 1 iLCACTGTTTCCCCATCTTATTTCCCATGCAGTGATGGGA 


-5424 


C*C*Cl(~Z A r PPIP , P' A T'P' A T 1 ^ TTT^ 7\ prpmmmnmp tv tv my- mm -. n .n ■■■ ■ _ _ _ 

bbbbA ibbbAibAi L ± rAbTTTTCTGAATGTTGAGCTTTAAGCCAACTTT 


-5374 


TTP A A TPTPPTPTTTP A p ,^ ^ ^ ^'T»^ , a-ipa tv r* ?v ->/~«mmmmi-i --^ ^- „ 

1 1 iLiLLibi 1 1 bAb ill LATCAAGAGGCTTTTGAGTTCCTCTTCAC 


-5324 


m ^'T , P ,r PP!P , p 1, A T A A P , P , P ,r PP'P ,r PP , Ti/~^ 7\ rnr*mr>r*> > m-n m^-i-- « - - ^ 

- 1 1 b 1 bbbAiAAbbb 1 bb 1 GTCATCTGCATATCTGAGGTTATTAATATTT 


-5274 


CTCCCGGCAATCTTGATTCCAGCTTGTGCTTCTTCCAGCCCAGTGTTTCT 


-5224 


CATGATGTACTCTGCATAGAAGTTAAATAAGCAGGGTGACAATATACAGT 


-5174 


CTTGACATCCTCCTTTTCCTATTTGGAACCAGTCTGTTGTTCCATGTCCA 


-5124 


GTTCTAACTGTTGCTTCCTGACCTGCATACAGGTTTCTCAAGAGGCAGGT 


-5074 


CAGGTGGCAGGTCAGGTGGTCAGGAACATCTCTTTCAGAATTTTTGACAG 


-5024 



Fig 1 contd 



WO 00/01810 PCT/NZ99/00107 

5/10 



TTTATTGTGATCCACACAGTCAAAGGCTTTGGCATAGTCAATAAAGCAGA 


-4974 


AATAGATGTTTTTCTGGAACTCTCTTGCTTTTTCGATGATCCAGCAGATG 


-4924 


TTGGCAATTTGATCTCTGGTTCCTCTGCCTTTTCTAAAACCAGCTTGAAC 


-4874 


ATCAGGAAGTTCATGGTTCACGTATTGCTGAAGCCTGGCTTGGAGAATTT 


-4824 


AGAGCATTACTTTACTAGCATTACTTTTCACAATAAACTGTGGAAAATTC 


-4774 


TGAAAGAGATGGGCATACCAGACCACCGGATCTGCCTCTTGAGAAATTTG 


-4724 


C ATGC AGGTC AGG AAGC AACAATTAG AAGTGGAC ATGG AAC AAC AGAC TG ' 


-4674 


GTTCCAAATAGGAAAAGGTGTTCGTCAAGGCTGTATATTGTCACCCTGTT 


-4624 


TATTTAACTTCTATGCAGAGTACATCATGAGAAACGCTGGGCTGGAAGAA 


-4574 


GCACAAGCTGGAATCAAGATTTCCGGGAGAAATATCAATAACCTCAGATA 


-4524 


TGCAGATGACACCACCCTTATGGCAGAAAGTGCAGAGGAACTAAAAAGCC 


-4474 


CCTTGATGAAAGTGAAAGTGGAGAGTGAAAAAGTTGGCTTAAATCTCAAC 


-4424 


ATTCAGAAAACGAAGATCATGGCATCCGGTCCCATCACTTCATGGGAAAT 


-4374 


AGATGGGGAAACAGTGGAAACAGTGTCAGACTTTATTTTTCTGGGCTCCA 


-4324 


AAATCACTACAAATGGTGACTGCAGCCATGAAATTAAAAGATGCTTACTC 


-4274 


CTTGGAAGGAAAGTTATGACCAACCTAGATAGCATATTGAAAAGCAGAGA 


-4224 


CATTACTTTGCTAACAAAGGTCCATCTAGTCAAGGCTATGGTTTTTCCTG 


-4174 


TGGTCATGTATGGATGTGAGAGTTGGACTGTGAAGAAAGCTGAGTGCCGA 


-4124 


AGAATTGATGCTTTTGAACTGTGGTGTTGGAGAAGACTCTTGAGAGTCCC 


-4074 


TTGGACTGCAAGGAGATCCAACCAGTCCATTCTGAAGGAGATCAGCCCTG 


-4024 


GGATTTCTTTGGAAGGAATGATGCTAAAGCTGAAACTCCAGTACTTTGGC 


-3974 


CACCTCATGTGAAGAGCTGACTCATTGGAAAAGACTCTGATGCTGGGAGG 


-3924 


GATTGGGGGCAGGAGGAGAAGGGGGCGACAGAGGATGAGATGGCTGGATG 


-3874 


GCATCACTGACTCAATGGACGTGAGTCTGTGTGAACTCTGGGAGTTGGTG 


-3824 


ATGGACAGGGAGGCCTGGCGTGCTTTGATTCATGGGGTCACAAAGAGTCG 


-3774 


GACACGACTGAGCGACTGATCTGTCTCTCTCTTACTAGCATGTGTCTCCT 


-3724 


CCTTTTTTTGCCACATCATCAAACTCCTGGCAAACTTCACATTAATAACA 


-3674 
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1 1 1 bbbAbL 1 I ^AUAAluuAAACAGTGAAACCATTAATGTTTTTTGGGA 


-3624 


AATATGCT I TATACTCTCAATGTTGTTTTGAAACGCACACCCCCTCCCCT 


-3574 


GCCGCCTGGTGTTTGTAAGACAGTTGAGAGAAGTTTGCTTGCTACCTTAC 


-3524 


TTATGGTTACACAAACGTAAGGCCCCCTGAGTACAAAGAAGAACAGGGGG 


-3474 


AACGCAAACTTCAGGCCCTGTGAGGAGGGGCACTGGACTCCTGTGAGAAG 


-3424 


AAACTGCTTTCAAAGAATTCCTGGGAGAAATTCTCTATGCACTCATCCTA 


-3374 


GCAACAAAGTCCTGTCCGAAGTTAGGCCCGCAGCACCCACACGGCAGTGA 


-3324 


AGGTTCCTACTGCTGGTGAACCTTGCTGCTCCGAAGCCATAGGAAGGTTG 


-3274 


CAAATCCCvavaCACTGGAGAAGGAAAACACGTTCTTGAAATTTCTTGAGTA 


-3224 


CCTCTTAATTCATTCAATGCTGACCTCCGGAGATTGGATAGAGCTGACTC 


-3174 


TCATTATTCACAGTGGTTATGTTCTACCCAATCACTGCCAACATGAATAA 


-3124 


GTGATTCCTGAACCACTGCCCCTAGGGGAACTACAAGGTTAGATTCCCGT 


-3074 


CAGCCTCTGGTCACGTTTTTGTTAACCAATCAATAAATAACCTTGTTTTG 


-3024 


TGTGCATTTCTGTTTTAAAATATCTTTATTTAATACGTACTGCTAATTCT 


-2974 


TCAAC^TTTGGTTCACAACCAAAAGGCCCTATTAACTGGAAGCCCTGAAT 


-2924 


GAAGCTTACATAACACACATTGTTTTTCTATGAGGAAAAATTTTTCCTTC 


-2874 


AGTCCTGCCACAGCCTTCTTGCTTAAAATTGTGGACAAAATATACATAAC 


-2824 


ATGAAACTGACTATTATTTAACTGATTTTAACTGTACAGTTCAGTGGCAT 


-2774 


TAAGTACACTTACCTTGCTGTGGAACTATGATCAACATTCATCTCTAGAA 


-2724 


CTTTTTGATCTTCTCGAATTGAAAACTCTGCATCTATTGCACAGTAGCTT 


-2674 


TCCCCCaTCACACCGCACCCTTCCTCCAGCCCCCCGGCACCACCATCCTT 


-2624 


CTTTCCATGACAGTCATCCTGTGCCTAGGAACACAGCCCTTCAACACTAC 


-2574 


GCTTGGGGGGCACTGTAAGCAACAGGATCACTCCCTACCGCCACCAAATG 


-2524 


CACACAAAAATATAAAAAGCATGGTGGCATATCGATTGCAAAAAGGGTGC 


-2474 


TTGCTAAGTATGAGGGCTGAAACAAGGCAGAGAATTGACTAGGTTGACCT 


-2424 


CAGCTGGGATCCTGTGTGTTGGAAGCCTCAAATTTTCCATTGTTCTGTGC 


-2374 


ATACGCACAAATGCTTATAAAAGCACTGTAAGGATTGATTATGAAGTTAA 


-2324 
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GATAAATCTCAGCAAGACATAAATGTGCAAGCACGGGATCCATGAATAAC -2274 
GAGCACTGACCATGTGGAAATGATAATCTTTGTTTCCTTTATTCCAGGCA -2224 

GTAAGGAGAAAGCGCTCACAGGGCTGCCTTACACCATTTTACTAGAGAGC - 2 17 4 

TAGCCTATGTCAGTCGGTAGCTGGCAATTACAAACTGAAGCAGTTCTAGT - 2 1 2 4 

TCATGTGGAGGATGAATTTAACCATAATCTCAACCCCCTCTGCATGAAAC - 2 0 7 4 

AGAGACTAAGTACTCAAGTACCAGTTATCAGTCACTTACTATATGACAGG - 2 0 2 4 

CACTGTACTCAACAATTTACATGTATTATTGAATTACATGCCCCCAACAC - 1 9 7 4 

TCTATGAGGAAGCTGAAGGTTAGAGAAGTATCTCATTCATTATTACACAG - 1 9 2 4 

TGGCAAACTGAGATCTGAACTCAGGTCTATCCAACTCCAGGACCTGAGAT - 1 8 7 4 

CCCAATTGCTACACAATTCTAATC AAGTTAAAAGGGAAAAAGGATTTGAT - 1 8 2 4 

TTGCTCAGAAGTGTATAGGGGCATATGTTACAATTATAACATTACAAAGA - 1 7 7 4 

TTTATATGTTGAAAAATAAATTTATCAAACAAATAAAACTTTATAAGCCT - 17 2 4 

GATCTAATACTGCTCCGCAACAAAGACTATCTGAAATCCTTCAGGGCATC - 1 6 7 4 

TGGTTTGTGTCTGGTTTTCCTTAATCTTTAATGATGGGCAAATCTAATGC - 1 6 2 4 

ATTATGTAAGGCCATTTTTTCTCAAGAGATGTAGATACCTCTTAAGAATT - 1 5 7 4 

TGATGAAAATGCATTAACTTTTCAGGCTACTGAGTTGCATTTTAGTGCAC - 1 5 2 4 

TG AGG C AG T AAATTAG TGT AC AATG TG CG AAAG TAGTG ACC TAAAAAATA - 1 4 7 4 

AATATTTGATATGAACCACTGCACTCTCTTGGGGAAAAAAGTAATGG ATT - 1 4 2 4 

AACTCTCTTAGGAGTCCTTAGCTTCCCCAAAAGTAGTAGGAAAAATAAAT - 1 3 7 4 

CTCCTGTGGCCTGGAAACAGCTTCTGTTTCTTGCTGGCTATATTTGTTTA -1324 

GGTTTTTAATAGTTCATTTGATTAGACCTTGTGGCTCCCAAAGCTAAGGT - 1274 

E-box (4) 

TGAGAGTTTGATCCCTACAGAGGCC ACTTCAATTTAGAGAACAAAAAGCC - 1 2 2 4 

CCATTCTCTGCTCCCAGACCTTACCCCAAATCCCTGCCAGGTGTCTGCCC - 1 17 4 

TCCGGTCAAATGAGAAACTGGCAAAGGAAGTACTAGGAGGTCGCACAGTA - 1 12 4 
E-box (3) 

CTAGGAAGTAGAAAAATGG ACTAGC ACACTACTGAGAAGCAGAAAAATGG - 1 0 7 4 
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GCACCCTTCATGATGGTGTTCCTTTCCCTTTCTGTGTTCACAATGCTCCG - 1 0 2 4 

ATATAATTTACAGAGGGTAGATAACTACATTTTTTTCTTTTACCACTGGA - 9 7 4 

AGGCTGAGGAAAACTTTGTTACCCATCATAAAATTCACTATCTTCTAAGT - 9 2 4 

CATTCTATGTTATTCTAAGATCAAATAGCTGAC AATATCCTCTTTGTAAT - 8 7 4 

AAAC AATGAAAAAAC AC ATCC TC TG AGC AATATTAATCTG C AAC TTTAG G - 8 2 4 

ATAGGAAGTAACTTAATACTAGTCAATTGAAACTGAAATACAATTTTCAT - 7 7 4 

ATGAATAAAAG ATATTATTT AAAAG TAATTCC ATG AGC AATTTAATATT A - 7 2 4 

AAGTAGGATTTTCATTATGTGTTAAGAATTTATTCAGGGAAACAAGTTTC - 6 7 4 

TCAAATTATAGCAGAAAATCTTTTACTAGTATCACAGTCTTTTCATTTAA - 6 2 4 

GTCTTCCTGAATAAATCTGTATTTTCTAATTATACAAGACTAAAAATAAT - 5 7 4 

MEF2-site 

TT AATATAAC AAATAAAATTATTTTTAC TTCAAATGCTTA C TTAAATAGT - 5 24 

E-box (2) 

ATAAAATCATTTTATTTTCTGAGGGAAAAGCATATC AACTTTTTAAGTAT - 4 7 4 
TATA- box (3) 

GAAGTGTAAATTAAGATTTATTCACTTAAATTATAATTTTTAAAGTTTCA - 4 2 4 

CATATAAAGATGAATAAGATCTAAGTGTATATGTTATTGTTAATAAAGTT - 3 7 4 

TTTAATTTTTCGCATGTCACATACAGCCTTTATTATTCATAGATTTATTC - 3 2 4 

CTTTTAAGAAGTAGTCAAATGAATCAGCTCACCCTTGACTGTAACAAAAT - 2 7 4 

E-box (1) 

ACTGTTTGGTGACTTGTGACAGACAGGGTTTTAACCTCTGACAGCGAGAT - 2 2 4 

TCATTGTGGAGCAAGAGCCAATCACAGATCCCGACGACACTTGTCTCATC - 1 7 4 

CAAT-box 

AAAGTTGGAATATAAAAAGCC AC TTGGAATACAGTAT AAAAG ATTCACTG - 1 2 4 

TATA-box (2) TATA-box (1) 

GTGTGGCAAGTTGTCTCTCAGACTGGGCAGGCATTAACGTTTGGCTTGGC - 7 4 

GTTACTCAAAAGCAAAAGAAAAGTAAAAGGAAGAAGTAAGAACAAGGGAA - 2 4 
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AAGATTGTATTGATTTTAAAACCATGCAAAAACTGCAAATCTCTGTTTAT 



+ 27 



ATTTACCTATTTATGCTGATTGTTGCTGGCCCAGTGGATCTGAATGAGAA 



+ 77 



CAGCGAGCAGAAG +90 

FIG. 1. Nucleotide sequence of the 10 , 492-basepair promoter- 
enhancer region of bovine myostatin. Total length of the DNA 
sequence is 10,492 basepairs (bps) which consist of 10,402 
bps of upstream and 9 0 bps of coding sequence. The coding 
sequence of myostatin is underlined. Nucleotides and the 
relative position of putative nucleotide motifs are numbered 
with respect to the translation start site (+1 bp) . 
Consensus sequences for basic functional elements of a 
mammalian promoter (TATA and CAAT boxes) and for known muscle 
specific transcription factors (E-box and MEF2 sites) are 
bolded in the sequence and they are listed below. 

TATA -boxes [ 1 ] : -139 bps(i); -163 bps (2); 



524 bps (3) 



CAAT-box[2] : 



206 bps 



E-boxes [3] : 



308 bps (1) ; 



543 bps (2) 



1167 bps (3); -1309 bps ( 4 ) 



MEF2 site [4] : 



584 bps 
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